Language agnostic missing subtitle detection
نویسندگان
چکیده
Abstract Subtitles are a crucial component of Digital Entertainment Content (DEC such as movies and TV shows) localization. With ever increasing catalog (≈ 2M titles) localization expansion (30+ languages), automated subtitle quality checks becomes paramount. Being manual creation process, subtitles can have errors missing transcriptions, out-of-sync blocks with the audio incorrect translations. Such erroneous result in an unpleasant viewing experience impact viewership. Moreover, correction is laborious, highly costly requires expertise languages. A typical process consists (1) linear watch movie, (2) identification time stamps associated blocks, (3) correcting procedure. Among three, taken to entire movie by human expert most consuming step. This paper discusses problem transcription, where corresponding some speech segments DEC non-existent. We present solution augment automatically identifying timings non-transcribed dialogues language agnostic manner. The step then be performed either human-in-the-loop mechanism or using neural transcription (speech-to-text same language) translation (text-to-text different languages) engines. Our method uses voice activity detector (VAD) classifier (AC) trained explicitly on corpora for better generalization. three steps: first, we use VAD identify (predicted blocks). Second, refine those AC module removing leading trailing non-speech identified VAD. Finally, compare predicted dialogue file (subtitle blocks) flag transcriptions. empirically demonstrate that proposed (a) reduces 10%, (b) improves 2.5%, (c) false positive rate (FPR) overextending 77%, (d) block-level precision 119% over baseline human-annotated dataset blocks.
منابع مشابه
Language Agnostic Dictionary Extraction
Ontologies are dynamic artifacts that evolve both in structure and content. Keeping them up-to-date is a very expensive and critical operation for any application relying on semantic Web technologies. In this paper we focus on evolving the content of an ontology by extracting relevant instances of ontological concepts from text. The novelty of this work is that we propose a technique which is (...
متن کاملTowards Language-Agnostic Mobile Code
The Java Virtual Machine is primarily designed for transporting Java programs. As a consequence, when JVM bytecodes are used to transport programs in other languages, the result becomes less acceptable the more the source language diverges from Java. Microsoft’s .NET transport format fares better in this respect because it has a more flexible type system and instruction set, but it is not exten...
متن کاملLanguage-Agnostic Relation Extraction from Wikipedia Abstracts
Large-scale knowledge graphs, such as DBpedia, Wikidata, or YAGO, can be enhanced by relation extraction from text, using the data in the knowledge graph as training data, i.e., using distant supervision. While most existing approaches use language-specific methods (usually for English), we present a language-agnostic approach that exploits background knowledge from the graph instead of languag...
متن کاملLanguage-Agnostic Event Detection Across Sports from Twitter Using Temporal Features
A significant body of work focuses on detecting events in social media, much of which relies on restrictive assumptions like prior event-centric filtering or known language models. Rather than build these expensive pre-processing and filtering pipelines, we explore language-agnostic techniques using only temporal features like token frequency. We construct temporal features for tokens in an unf...
متن کاملAdaptive context-agnostic floor transition detection
We present a technique for detecting floor changes in an indoor environment and improving pedestrian indoor localization and navigation. Our technique relies on barometric pressure sensors commonly available on smartphones and tablets. We developed an algorithm running on smart mobile devices that can be integrated in any indoor localization system to improve accuracy and support 3D navigation....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Eurasip Journal on Audio, Speech, and Music Processing
سال: 2022
ISSN: ['1687-4722', '1687-4714']
DOI: https://doi.org/10.1186/s13636-022-00244-9